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AMENDMENTS TO THE CLAIMS 

This listing of claims replaces all prior versions and listings of claims in the 
application. 

IN THE CLAIMS 

1 . (Currently amended) A reduced instruction set computer architecture implemented 
on a programmable logic device, comprising: 

a parallel bit shifter capable of reversible shifts and bit reversals; 

a Reed-Muller Boolean unit coupled to the parallel bit shifter; [[and]] 

an immediate instruction function that, via constant modes, variously 
manipulates distribution of a set of literal bits of a half-word literal field from an 
instruction word across a full-length data word : and 

wherein at least one of the constant modes causes each bit of the half-word 
literal field to be duplexed into two adjacent bits in the full-lenath data word . 

2. (Original) The reduced instruction set computer architecture of claim 1 , wherein 
the parallel bit shifter is a parallel 32-bit shifter. 

3. (Original) The reduced instruction set computer architecture of claim 1 , wherein 
the parallel bit shifter comprises logic ranked into four sections. 

4. (Original) The reduced instruction set computer architecture of claim 3, wherein 
the parallel bit shifter further comprises a split shift direction control signal applied on 
at least two of the four sections. 

5. (Original) The reduced instruction set computer architecture of claim 1 , wherein 
the Reed-Muller Boolean unit performs any Boolean operation with two operands and 
an inverse of the Boolean operation on data words by changing only a single control 
bit value. 
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6. (Original) The reduced instruction set computer architecture of claim 1 , wherein 
the Reed-Muller Boolean unit multiplexes between two input operands by 
transposition of lone control bits. 

7. (Original) The reduced instruction set computer architecture of claim 1 , wherein 
the immediate instruction function treats the half-word literal field as one among a 
lower-half word, an upper-half word, a zero-filled word, a one-filled word, a sign- 
extended word, or a replicated 1 -bit for 2-bits word. 

8. (Original) The reduced instruction set computer architecture of claim 7, wherein 
the constant modes of the immediate instruction function include composite 
immediate instructions selected from the group of instructions comprising 
AND_FILL_LOW, OR_LOW, XOR.LOW, ADD.LOW, AND_FILL_HIGH, OR.HIGH, 
XOR.HIGH, ADD_HIGH, AND.DUPLEX, OR.DUPLEX, XOR.DUPLEX, 
ADD_DUPLEX, AND.SIGN, OR_SIGN, XOR_SIGN, and ADD_SIGN. 

9. (Currently amended) A system-on-chip, comprising: 

a reduced instruction set computer processor implemented on a field 
programmable gate array fabric; and 

an a simp l e and ba l anood instruction set utilizing a m i nima l amount of 
resources from the field programmable gate array fabric, wherein the processor is 
synthesizable from hardware description language, 

wherein the instruction set consists of 32 instructions, and wherein each 
instruction of the instruction set is a same size : and 

wherein one instruction of the instruction set is an immediate instruction that, 
via constant modes, variously manipulates distribution of a set of literal bits of a half- 
word literal field from an instruction word across a full-lenath data word, and at least 
one of the constant modes causes each bit of the half-word literal field to be duplexed 
into two adjacent bits in the full-lenath data word . 

1 0. (Original) The system-on-chip of claim 9, wherein the same size is 32 bits. 
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1 1 . (Original) The system-on-chip of claim 9, wherein one instruction of the instruction 
set is a reserved for future use (rfu) instruction. 

12. (Original) The system-on-chip of claim 9, wherein a series of bitfield layouts of 
machine words are arranged and constructed for compatibility with on-chip intellectual 
property cores. 

13. (Original) The system-on-chip of claim 12, wherein the series of bitfield layouts 
are arranged and constructed for use with standard processor buses. 

14. (Original) The system-on-chip of claim 9, wherein the processor includes at least 
16 general purpose registers and 7 special purpose registers. 

15. (Cancelled) 

16. (Original) The system-on-chip of claim 9, wherein bitfield layouts for instruction 
formats and instruction ordering are arranged for efficient use of FPGA resources. 

17. (Original) The system-on-chip of claim 9, wherein instruction formats include a 
balance of single and paired instructions well-suited for compilation, such that after a 
first-issued instruction is considered by a compiler for issue, the compiler beneficially 
considers no more than 30 alternative instructions for a second-issued instruction to 
pair with the first-issued instruction. 

18. (Original) The system-on-chip of claim 9, wherein instruction formats include a 
balance of single and paired instructions well-suited for a C runtime software 
environment, such that instruction sequences required to implement C language 
constructs naturally break into instruction sequences comprising one or two 
instructions. 
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19. (Original) The system-on-chip of claim 18, wherein the single and paired 
instructions minimize interrupt latency. 

20. (Original) The system-on-chip of claim 18, wherein a minimal instruction overhead 
and the single and paired instructions facilitate interrupt driver software design directly 
in the C language. 

21 . (Original) The system-on-chip of claim 9, wherein the processor is based on a 32- 
bit architecture. 

22. (Original) The system-on-chip of claim 9, wherein the processor comprises at 
least one among a balanced instruction set; a set of six instruction forms; a byte- 
addressed 32-bit address space; addressing by word, half-word, or byte; a little endian 
byte ordering; and big endian byte ordering. 

23. (Currently amended) the system-on-chip of claim 9 [[8]], wherein the processor 
is based on one among a 16-bit, a 32-bit, a 64-bit, a 128 bit, a 256-bit, a 512-bit and a 
1024-bit architecture. 

24. (Currently Amended) A method of forming a reduced instruction set computer 
processor, comprising the steps of: 

embedding a processor core on a field programmable gate array; [[and]] 

deploying a simple instruction set optimized for a compiler, wherein the 
processor is directly synthesizable from a hardware description language, 

wherein the instruction set consists of 32 instructions, and wherein each 
instruction of the instruction set is a same siz e: and 

wherein one instruction of the instruction set is an immediate instruction that. 
via constant modes, variously manipulates distribution of a set of literal bits of a half- 
word literal field from an instruction word across a full-length data word, and at least 
one of the constant modes causes each bit of the half-word literal field to be duplexed 
into two adjacent bits in the full-lenath data word . 
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25. (Original) The method of claim 24, wherein one instruction of the instruction set is 
a reserved for future use (rfu) instruction. 

26. (Cancelled) 

27. (Original) The method of claim 24, wherein embedding the processor core 
comprises configuring the field programmable gate array to include a parallel bit shifter 
capable of reversible shifts and bit reversals and a Reed-Muller Boolean unit coupled 
to the parallel bit shifter. 

28. (Currently amended) A reduced instruction set computer architecture 
implemented on a programmable logic device, comprising: 

a parallel bit shifter capable of reversible shifts and bit reversals; 
a Reed-Muller Boolean unit coupled to the parallel bit shifter; and 
an immediate instruction function used in conjunction with the parallel bit shifter 
and Reed-Muller Boolean unit, wherein the immediate instruction function uses a 
single word instruction having N possible modes and having a plurality of instruction 
bits, with half of the plurality of instruction bits of the single word allocated to 
immediate data; 

wherein the immediate instruction function further includes a predetermined 
number of separate bit mask sub-modes, and the separate bit mask sub-modes are 
selected from the group of bit mask sub-modes comprising FILL LOW. FILL HIGH. 
LOW. HIGH. DUPLEX, and SIGN bit masks: and 

wherein the LOW bit mask sub-mode is used for at least one of inserting literal 
bits into a lower half-word of a general purpose register and inserting as one of a pair 
of instructions for a full-word literal, the HIGH bit mask sub-mode is used for at least 
one of inserting literal bits into an upper half-word of a general purpose register and 
inserting as one of a pair of instructions for a full-word literal, the DUPLEX bit mask 
sub-mode is used for creating nvbble. bvte and half-word mask values flexibly, and the 
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SIGN bit mask sub-mode is used for incrementing or decrementing a general purpose 
register bv a constant value when combined with an ADD operating mode . 

29. (Original) The reduced instruction set computer architecture of claim 28, wherein 
the immediate instruction function has 1 6 possible modes. 

30. (Cancelled) 

31 . (Currently amended) The reduced instruction set computer architecture of claim 
28 [[30]], wherein the immediate instruction function further includes a predetermined 
number of separate arithmetic logical operating modes that are t ho oooarato oporat i on 
sub-modos are selected from the group of ar i thmot i o l ogica l operat i ng modoo 
comprising AND, OR, XOR, and ADD, and 

wherein the AND mode is used for at least one of zeroing unwanted literal bits 
and masking of an operand, the OR mode is used for inserting desired literal bits into 
a selected general purpose register, the XOR mode is used for complementing select 
bits of an operand in a general purpose register when necessary, and the ADD mode 
is used for immediate arithmetic operations. 

32-34. (Cancelled) 

35. (Currently amended) A system-on-chip, comprising: 

a reduced instruction set computer processor implemented on a programmable 
logic device fabric; 

an a oimplo and balanood instruction set utilizing a m i n i ma l amount of 
resources from the field programmable gate array fabric, wherein the processor is 
synthesizable from hardware description language; and 

a horizontally scalable immediate instruction using multiple vectorized versions 
of an N-bit architecture : and 

wherein the reduced instruction set computer processor is a digital signal 
processor wherein the digital signal processor generates two N-bit words of constant 
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data using an N/2 bit immediate instruction having N/4 bits dedicated for immediate 
data . 

36. (Original) The system-on-chip of claim 35, wherein the horizontally scalable 
immediate instruction concurrently uses multiple vectorized versions of the N-bit 
architecture. 

37. (Original) The system-on-chip of claim 35, wherein the N-bit architecture can be 
selected among multiple 16-bit, 32-bit, 64 bit, 128-bit, 256-bit, 512-bit and 1024-bit 
vectorized versions. 

38. (Original) The system-on-chip of claim 35, wherein the N-bit architecture can be 
selected among multiple 18-bit, 36-bit, 72 bit, 144-bit, 288-bit, 576-bit and 1152-bit 
vectorized versions. 

39. (Cancelled) 

40. (Currently amended) The system-on-chip of claim 35 [[39]], wherein the two N-bit 
words comprise constants for a Fast Fourier Transform. 

41 . (Currently amended) The system-on-chip of claim 35 [[39]], wherein the two N-bit 
words comprise in-phase and quadrature data for complex signal processing. 

42. (Currently Amended) The system-on-chip of claim 35, wherein the instruction set 
further comprises a reserved for future use (rfu) instruction that enables vertically 
scalable architectural variants implemented on the field programmable gate array 
fabric. 

43. (Original) The system-on-chip of claim 42, wherein the rfu instruction enables 
upward compatible single and double-precision floating-point arithmetic operations via 
the field programmable gate array fabric. 
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44. (Original) The system-on-chip of claim 43, wherein the rfu instruction uses a 
single standard extension at a hardware/software boundary for variant architectures 
while minimally impacting a compiler design for the variant architectures. 
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